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METHOD, APPARATUS AND COMPUTER PROGRAM FOR 
SEARCHING MULTIPLE INFORMATION SOURCES 

Field of the Invention 

The present invention relates to information sources and more particularly to 
searching multiple machine-readable infomiation sources. 

Background 

String searching (e.g., by keyword or plirase) represents one of the most 
common forms of searching perfonned on macliine-readable infonnation sources or 
databases. Search strings may also be combined using Boolean operators to perform 
so-called Boolean searches. 

Successful seai'ching is generally dependent on an appropriate selection of 
search strings. For more specialised information sources, such as those relating to a 
specialised field or art, selection of suitable search strings requires knowledge of 
specific temis used in the particular field or art. Thus, searching the most relevant 
infomiation sources may not yield optimal results if the appropriate string is not 
selected as the basis for the search. One such specialised field is that of biomedical 
science. 

MEDLINE is a bibliogi'aphic database published by tlie U.S. National Libraiy of 
Medicine (NLS) that covers the fields of medicine, nursing, dentistry, veterinary 
medicine, the health care system, and the preclinical sciences. MEDLINE provides 
access to abstracts of articles and citations from more than 4,000 biomedical journals 
. published worldwide. 

The Medical Subject Headings (MeSH'^) is a controlled vocabulary produced by 
the NLS that may be used for indexing, cataloguing, and searching for biomedical and 
health-related infomiation and documents. Various online systems provide access to 
MeSH"®. Such systems include the MeSH"^ Browser, which contains the complete 
contents of the vocabulary, the MeSH® Entrez databases, which are designed to assist 
those searching MEDLINE or PubMED, and the UMLS Metathesaurus®, wherein the 
MeSH® vocabulary is combined with a number of other controlled vocabularies. The 
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UMLS Metathesaurus® is designed to facilitate retrieval and integration of 
infonnation from multiple machine-readable information sources such as descriptions 
of the biomedical literature, clinical records, factual databanks, knowledge-based 
systems, and directories of people and organisations and are specifically directed to 
developers of information retrieval systems. 

Numerous organisations offer access to the MEDLINE database with differing 
ways of searching the database. One such MEDLINE service is the PubMED seiTice 
offered by the U,S. National Library of Medicine (NLM). Another MEDLINE service 
using MeSH'^is offered by Ovid Technologies, Inc. 

Another bibliographic database that provides access to literature on 
pharmacology and bio-medicine is EMBASE, which is produced by Elsevier Science 
B.V. Various organisations offer access to the EMBASE database with differing 
searching methods and vocabularies. For example, Ovid offers access to EMBASE 
using the EMTREE vocabulary. 

As may be understood from the foregoing, numerous separate infonnation 
sources relating to the biomedical field are published worldwide as electronic 
resources or databases. However, major obstacles to the effective retrieval and 
integration of infomiation from multiple sources deter medical and health-care 
professionals and researchers from using available machine-readable infonnation. 
Such obstacles include: 

• the large variety of vocabularies and classifications used in different sources 
and by different users, and 

• the sheer number and wide distribution of potentially relevant information 
sources. 

Some existing mechanisms for seai*ching machine readable infonnation sources 
such as Ovid and PubMED provide a limited facility to map search strings to 
alternative search tenns, particulariy when multiple information sources are required 
to be searched. 

A need thus exists for improved methods, apparatuses and computer programs 
for searching multiple infomiation sources. 
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Summary 

According to an aspect of the present invention, there is provided a method for 
searching a plurality of machine-readable information sources. The method comprises 
the steps of: 

mapping a search string to a plurahty of search terms, wherein each search term 

relates to at least one of the plurality of information sources; 

indicating at least one infomiation source that each search temi relates to; and 
searching at least one indicated inforaiation source using selected ones of the 

search terms. 

According to another aspect of the present invention, there is provided an 
apparatus for searching a plurality of macliine-readable infomiation sources. The 
apparatus comprises: 

a communications interface for transmitting and receiving data; 

a memory unit for storing data and instructions to be performed by a processing 
unit; and 

a processing unit coupled to the communications unit and the memory unit, the 
processing unit programmed to: 

map a search string to a plurality of search tenns, wherein each search temi 
relates to at least one of the plurality of infomiation sources; 

output an indication of at least one infonnation source that each search teim 
relates to; and 

search at least one indicated information source using selected ones of the 
search temis. 

According to another aspect of the present invention, there is provided a 
computer program product comprising a computer readable medium having a 
computer program recorded therein for searching a plurality of information sources. 
The computer program product comprises: 

computer program code for mapping a search string to a plurality of search 
terms, wherein each search temi relates to at least one of the plurality of information 
sources; 
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computer program code for outputting an indication of at least one infonnation 
source that each search tenn relates to; and 

computer program code for searching at least one indicated infonnation source 
using selected ones of the search terms. 

Indication of an infomiation source that a search term relates to may comprise 
indicating which of a plurality of infonnation sources each search tenns relates to 
and/or indicating which vocabulary each search term is included in, wherein each 
vocabulary relates to at least one infonnation source. 

The search terms may be selected from a vocabulary of terms used in a related 
one of the plui-ality of information sources or from a meta-vocabulary comprising a 
list of ternis included in a plurahty of vocabularies. 

According to yet another aspect of the present invention, there is provided a 
method for searching a plurality of machine-readable infomiation sources comprising 
tlie steps of: 

mapping a search string to a plurality of search tenns, wherein each search tenn 
relates to at least one of the plurality of infonnation sources; and 

searching at least one infonnation source using selected ones of the seai'ch 

terms. 

Other aspects of the present invention comprise an apparatus and a computer 
program product for practising the foregoing method. 

Brief Description of the Drawings 

Existing and new embodiments are described hereinafter, by way of example 
only, with reference to the accompanying drawings in which: 

Fig. 1 is a screenshot showing input of a string to an Ovid searching tool; 

Fig. 2 is a screenshot showing a mapping display for the string input in Fig. 1; 

Fig. 3 is a screenshot showing results of a search of an Ovid-deUvered version of 
the EMBASE database; 

Fig. 4 is a screenshot showing a menu for changing database; 

Fig. 5 is a screenshot showing results of a search perfonned on an Ovid- 
delivered version of the MEDLINE database; 
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Fig. 6 is a screenshot showing a mapping display; 

Fig. 7 is a screenshot showing results of a search perfonned on an OVID- 
delivered version of the MEDLINE database; 

Fig. 8 is- a flow diagram of a method for searching a plurality of machine- 
readable information sources according to an embodiment of the present invention; 

Fig, 9 is a screenshot showing input of a search string to the Universal Search 
Environment (USE) searching tool; 

Fig. 10 is a screenshot showing a mapping display for the search string input in 
Fig. 9; 

Fig. 11 is a screenshot showing results of two searches performed on the Ovid 
MEDLINE and EMBASE databases, respectively; 

Fig. 12 is a screenshot showing a menu for changing database and results of a 
search perfonned on the Ovid EMBASE database; 

Fig. 13 is a screenshot showing results of two separate searches perfomied on 
the Ovid EMBASE databases; 

Fig. 14 is a schematic block diagram of a computer system with which 
embodiments of the present invention may be practised; 

Fig. 15 is a screenshot showing input of a search string to the Universal Search 
Enviromnent (USE) searching tool; 

Fig. 16 is a screenshot showing a mapping display for the sti'ing input in Fig. 15; 

and 

Fig. 17 is a screenshot showing a dropped-down instance of a field selection 
menu. 

Detailed Description 
A small number of embodiments are described hereinafter for searching a 
plurality of information sources. For ease of description, the embodiments are 
described with specific reference to medical sources or databases. However, it is not 
intended that the present invention be limited accordingly as the principles of the 
present invention have general applicability to numerous other machine-readable 
infomiation sources or databases. 
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The word "vocabulai-y", as used in the present specification, is intended to 
include both pubHshed and proprietary lists of words or terms within the scope 
thereof. A 'Vocabulary" may be generated based on tenns that are used in a particular 
database or may simply comprise a general list of temis used in a specific field or art. 

The word "tenn", as used in the present specification, is intended to include both 
words and plirases within the scope thereof A meta-vocabulary or meta-thesaurus 
typically comprises a consolidated list of terms that are or may be used in multiple 
information sources. *'Synonyms" or temis that have an equivalent conceptual 
meaning are typically grouped together as a "subject in a meta-vocabulary. Details of 
a source vocabulary from which a synonym originates are also typically stored in a 
meta-vocabulary. An "alternative subject" is another subject that is closely related but 
not identical to the original subject. 

The plirase "information source", as used in the present specification, includes 
both sti-uctured and unstructured databases within the intended scope thereof 
Examples of structured and unstnictured databases include bibliographic databases 
and machine-readable textbooks, respectively. 

Figs. 1 to 7 relate to an existing embodiment of a method for searching 
infomiation sources offered by Ovid Technologies, Inc. 

Fig. 1 shows input of the string "intestinal obstruction" 1 10 to Ovid. 

Fig. 2 shows mapping of the original- string 110 by Ovid to the search tenn 
"Litestine Obstniction" 210 using EMTREE. Ovid also offers a simple keyword- or 
phrase- type search based on the original string 110, which is shown as search term 
220 in Fig. 2. The ticks in the boxes to the left of the possible search ternis 210 and 
220 indicate user selection of the search term 210 and non-selection of the search terai 
220 for searching. 

Fig. 3 shows that 4581 matches resulted from searching the Ovid-delivered 
version of the EMBASE database using the search terni 310 from EMTREE, which 
corresponds to the seai'ch term 210 in Fig. 2. Activation of the display icon 320 by 
means of a pointing device causes the actual search results to be displayed. The 
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"Change Database" icon 330 may be activated to change from EMBASE to another 
database offered by Ovid. 

Fig. 4 shows a menu for changing from the EMBASE database to the 
MEDLINE database. Menu option 410 opens the MEDLINE database and re-runs the 
previous search history. Menu option 420 opens the MEDLINE database and clears 
the search liistory. Menu option 430 returns a user to the Main Search Page without 
changing databases. 

Fig. 5 shows the result of selecting menu option 410 in Fig. 4 and thus opening 
the MEDLINE database and re-executing the search using the same search term as 
that used in the previous search. Fig. 5 shows that zero matches were found by 
searching the OVID-delivered version of the MEDLINE database using the search 
term "Intestine Obstruction" 510 from EMTREE, which corresponds to the search 
tenn 210 in Fig. 2. The zero result is due to the fact that the search tenn 510 is not a 
MeSH® term for searching the MEDLINE database. 

Fig. 6 shows a list of subjects 610 for remapping the search temi "Intestine 
Obstruction", which corresponds to the search term 510 in Fig. 5. A user may select or 
deselect each of the various subjects 610 by ticking or un-ticking the boxes to the left 
of each subject. Fig, 6 shows only the subject "Intestinal Obstniction" 620 selected by 
way of the tick in the box to the left of the subject 620. The boxes relating to and to 
the left of the remaining subjects are un-ticked. 

Fig. 7 shows results of searches performed on the Ovid-delivered version of the 
MEDLINE database. Zero matches were foimd using the search term "Intestine 
Obstniction" 710 fi-om EMTREE, whereas 16615 matches were found using the 
search term "Intestinal Obstruction" 720 from MeSH®. 

Figs. 1 to 7 show that re-execution of a search on a different infonnation source 
using Ovid does not yield optimal results as the mapping of an original string to a 
plurality of alternative temis is not optimal for a different infonnation source. Optimal 
searching of a different infonnation source using Ovid thus requires the extra step of 
re-mapping the original string on a vocabulaiy related to, or used to index, the 
different information source. Fimhemore, Ovid disadvantageously fails to provide 
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any indication of the information sources or vocabularies the various subjects or 
search teims originate from or are related to. 

Fig. 8 is a flow diagram of a method for searching a plurality of machine- 
readable infoimation sources. 

At step 810, a search string is mapped to a plurality of search temis that are each 
included in at least one vocabulary relating to at least one of the plurality of 
infomiation sources. An indication of at least one infomiation source that each search 
term relates to is provided at step 820. Step 820 is an optional step in that it is not 
included in certain embodiments of the present invention. At least one indicated 
information source is searched at step 830 using selected ones of the search temis. 

The infoimation source/s that the search temis relate to is/are indicated to 
provide reassurance to a user that an appropriate mapping to search terms relating to 
desired vocabularies or infomiation sources is performed or available. The 
information source/s that the search teims relate to may be indicated by displaying 
references to one or more vocabularies related to each search tenia and/or one or more 
information sources related to each search temi, or both. As all of the search terms are 
preser\'ed across searches, additional searches may be perfoniied on multiple 
infonnation sources without the need for re-mapping of the search terms each time a 
different infonnation source is searched. 

Figs. 9 to 13 relate to an embodiment of the method of Fig. 8. 

Fig. 9 shows input of the search string "intestinal obstruction" 910 to the 
Unified Search Environment (USE), which comprises a computer software progi'am. 
Mapping of the search temi 910 is perfonned by user selection of a ''thesaurus" option 
920. Other options in place of the thesaurus option include a simple search using a 
keyword or phrase. The thesauins used by USE is based on the UMLS 
Metathesaurus®, which comprises its own set of temis, plus terms from a number of 
other vocabularies. 

Fig. 10 shows mapping of the subject 1010, which corresponds to the string 910 
in Fig. 9, to a set of synonyms 1020. As may be seen from Fig. 10, the term "Intestinal 
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Obstruction" comprises a preferred term for UMLS, Dxplain term and MeSH . 
Similarly, the term "ileus" comprises a prefeired temi for MeSH® and Dxplain, the 
tenn ^'Unspecified intestinal obstmction" comprises a preferred term for ICD9, the 
temi "INTESTINE, OBSTRUCTION" comprises a preferred term for Dxplain and 
EMTREE temi, and the tenns *'ileus of bowel" and **ileus of intestine" comprise 
prefeiTcd tenns for UMLS. The term "bowel obstruction" does not appear iii any of 
the vocabularies relating to the available databases. A user may select or deselect each 
synonym in the set of synonyms 1020 by "clicking" on the boxes to the left of the 
synonyms by means of a pointing device. 

One or more from a set of replacement subjects 1030 may be selected by a user 
to replace the list of synonyms 1010 for the currently mapped subject 1010. It is also 
possible for a user to add tenns from related subjects to the synonyms 1010 of the 
currently mapped subject 1010. 

UMLS, Dxplain, MeSH^, ICD9, and EMTREE comprise vocabularies for 
related databases. For example, MeSH® is a vocabulary used by KffiDLINE, 
EMTREE is a vocabulary used by EMBASE, and ICD9 is used in numerous medical 
record systems. 

Fig. 1 1 shows results of searches performed on the Ovid MEDLINE and Ovid 
EMBASE databases, respectively, using search terms 1110, 1130, which correspond 
to the multiple search tenns or synonyms 1020 selected in Fig. 10. The upper pane 
1170 and lower pane 1180 of the screenshot of Fig. 11 show search results from the 
Ovid MEDLINE and EMBASE databases, respectively. Searching the Ovid 
MEDLINE database yields 16641 matches 1120 and searching the Ovid EMBASE 
database yields 6441 matches 1140. The numbers of matches 1120 and 1140 shown in 
Fig. 11 are higher than the numbers of matches 320 and 740 shown in Figs. 3 and 7, 
respectively, on account of the additionally identified MeSH® search temi "Ileus" 
being searched. 

The ''Change Database" icons 1150 and 1160 may be activated to change 
database fi'om MEDLINE or EMBASE, respectively. 

Fig. 12 shows a menu for changing from the MEDDLINE database to the 
EMBASE database in the upper pane 1240. The lower pane 1250 corresponds to the 
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lower pane 1180 in Fig. 11. Menu option 1210 opens the EMBASE database and re- 
runs the previous search history (i.e., search history 1110, 1130 as shown in Fig. 11). 
Menu option 1220 opens the EMBASE database and clears the search history. Menu 
option 1230 returns a user to the Main Search Page without changing databases. 

Fig. 13 shows the results of a user selecting menu option 1210 to open the 
EMBASE database and re-execute the search using the previous search history. As 
can be seen from the upper pane 1310 of Fig. 13, re-searching the EMBASE database 
using the previous search history 1320 yields 6441 matches 1330. This search result is 
the same as the previous search result 1340 obtained from searching the EMBASE 
database, which is shown in the lower pane 1350 and con-esponds to the search result 
shown in the lower pane 1250 in Fig. 12, This search result is conditional on the meta- 
thesaurus being used comprising a super-set of the EMTREE vocabulary, which 
relates to the EMBASE database. 

Advantageously, no loss of quality/infonnation results from the user switching 
bet\veen databases on account of the manner in which USE constaicts mapped queries 
using multiple (potentially) redundant tenns. 

Searching an Information Source 

An embodiment of a method for searching an infonnation source or database is 
described hereinafter. 

A search string entered by a user is mapped to a subject. The method used in 
USE to perform this mapping comprises the following steps: 

1 . Find subjects with a tenn, which in thefr entirety consist only of the search 
string. 

2. If no match from step 1 is available, find subjects with a term differing from 
the search string only by a spelling variation. The algorithm published by 
Porter is used to perform this step. Additional information regarding the 
Porter algoritlmi may be found in the relevant literature or at the URL: 
<littp://www.tartarus.org/'-martin/PorterStemmer/>, the contents of which 
are included herein by reference. USE also allows users to oveiride the 
Porter steimning algoritlini, and instead match with a wildcard. For example, 
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Porter stemming will peimit the input string "arteries" to be matched to 
"artery" but not to "arthouse". However, the search string "art*" will match 
to both "artery" and "arthouse". Numerous other matching algoritluns 
including fuzzy matching algorithms such as Levenshtein Edit Distance 
matching score may also be practised. Additional infoniiation regarding the 
Levenshtein algoritlim may be found in the relevant literature or at the URL: 
<http://www.merriampark.coni/ld.htm>, the contents of which are included 
herein by reference. 

3. If no match from step 2 is available, find subjects witli a term containing the 
search string, but also possibly containing additional strings (e.g., if the 
string "Intestinal Obstruction" was not found in steps 1 and 2, then the 
subject "Intestinal Obstruction without hernia" could be matched. 

4. If no match from steps 1 to 3 is available, search the UMLS Metathesaurus®, 
which contains a brief definition of each tenn in the LMLS Metathesaurus'®. 

The foregoing method generates a list of possible candidate search temis. hi 
addition to ranking these candidates in the above four broad categories, further 
ranking within categories is perfomied on the basis of a similarity score. A vector 
cosine measure algorithm is typically used to calculate this score. Additional 
infonnation regarding the vector cosine measure algorithm may be found in the 
relevant literature or at the URL: litfp://www.cs.ust.hlc/facultv/dlee/PaDers/ir/ieee-sw- 
rank.pdf . the contents of which are included herein by reference. 

Optional Furth er Extension 

An optional fiuther extension to the embodiments described with reference to 
Figs. 8 to 13 is that search strings comprising multiple sub-strings may be mapped to 
muUiple search tenns in a single step. The search string is disassembled into multiple 
sub-strings but the mamier in which the sub-strings are combined is preserved. 

The disassembly process takes place by detemiining keyword or phrase 
boundaries. A dictionary of boundary strings that play a grammatical role in marking 
out of such boundaries in natural language is maintained, so that search strings that 
resemble human natural language may be submitted for searching (e.g., "potassium in 



wo 2005/096174 



PCT/AU2005/000454 



-12. 

treatment of intestinal obstmction"). An example of such a dictionary may comprise 
the set of words: "in", "with", "for", "and", "or", and "of. 

The keywords or plirases delimited by such boundaries are extracted and used as 
search strings for the subject matching algorithm described hereinbefore. Reference 
designators are substituted into the original search string in place of the extracted 
keywords or phi-ases. Additionally, each of the words that match entries in the 
boundary dictionary is replaced with a Boolean operator by a set of predetermined 
rules (e.g., the word "with" may be replaced with the operator "AND", and the word 
"and" may be (trivially) replaced with the operator "AND"). 

An example of disassembly of the input search string "potassium in treatment of 
intestinal obstruction" is presented hereinafter. Fig, 15 shows user input of the string 
"potassium in treatment of intestinal obstruction" 1510 to USE. Thereafter, string 
1510 is disassembled into keywords or phrases as follows: 

Kl. "potassium" 

K2. "intestinal obstruction" 

K3. "treatment" 

Substitution of the reference designators Kl, K2, and K3 for .the keywords or 
plirases in the string yields: 
"Kl AND K2 AND K3" 

The reference designators Kl, K2 and K3 are then mapped in the same manner 
as a single keyword or phrase and all three mappings 1610, 1620 and 1630 are 
simultaneously displayed, as shown in Fig. 16. The "Replace" and "Add" 
functionality described hereinbefore now operates on a specific reference designator 
Kl, K2 or K3 depending on the row in which the "Replace" or "Add" is selected. 

Finally, the search terms or synonyms selected by the user are re-inserted in the 
search string by replacement of the reference designators Kl, K2, and K3. 

Additionally, ALL the selection checkboxes next to the search tenns or 
synonyms may be de-selected. This resuhs in the term being dropped completely 
(e.g., if all synonyms of potassium are de-selected, the substituted search query is 
reassembled as "K2 AND K3", where K2 and K3 are the synonyms selected for the 
remaining tenns "intestinal obstmction" and "potassium"). 
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A further feature is that a field list is created for each subject. The fields in a 
field selection menu 1640 that a user selects from may be customised based on the 
subject entered. Fig. 17 shows a dropped-down instance of the field selection menu 
1640. Field selection occurs simultaneously with mapping, rather than as a separate 
step. 

Existing systems such as Ovid require manual disassembly and separate user 
entry of each of the sub-strings '^potassium'* (1), "intestinal obstmction" (2), and 
"treatment" (3). A separate mapping is perfomied for each, before manual reassembly 
by entry of tlie Boolean expression "1 AND 2 AND 3". 

Computer hardware and software 

Fig. 14 is a schematic representation of a computer system 1400 that can be used 
to practise the embodiments described herein. Specifically, the computer system 1400 
is provided for executing computer software that is programmed to assist in 
performing a method for searching a plurahty of machine-readable information 
sources. The computer sofhvare executes under an operating system such as MS 
Windows XP"^^* or Linux^'^ installed on the computer system 1400, 

The computer softwai-e involves a set of progi'ammed logic instructions that may 
be executed by the computer system 1400 for instructing the computer system 1400 to 
perfonn predeteraiined functions specified by those instructions. The computer 
software may be expressed or recorded in any language, code or notation that 
comprises a set of instructions intended to cause a compatible information processing 
system to perform particular functions, either directly or after conversion to another 
language, code or notation. 

The computer software program comprises statements in a computer language. 
The computer progi'am may be processed using a compiler into a binary format 
suitable for execution by the operating system. The computer program is programmed 
in a mamier that involves various software components, or code means, that perform 
particular steps of the methods described hereinbefore. 

The components of the computer system 1400 comprise a computer 1420, input 
devices 1410, 1415 and a video display 1490. The computer 1420 comprises a 
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processing unit 1440, a memory unit 1450, an input/output (I/O) interface 1460, a 
communications interface 1465, a video interface 1445, and a storage device 1455. 
The computer 1420 may comprise more than one of any of the foregoing units, 
interfaces, and devices. 

The processing unit 1440 may comprise one or more processors that execute the 
operating system and the computer software executing under the operating system. 
The memory unit 1450 may comprise random access memory (RAM), read-only 
memory (ROM), flash memory and/or any other type of memory knovm in the art for 
use under direction of the processing unit 1440. 

The video interface 1445 is comiected to the video display 1490 and provides 
video signals for display on the video display 1490. User input to operate the 
computer 1420 is provided via the input devices 1410 and 1415, comprising a 
keyboard and a mouse., respectively. The storage device 1455 may comprise a disk 
drive or any other suitable non- volatile storage medium. 

Each of the components of the computer 1420 is connected to a bus 1430 that 
comprises data, address, and control buses, to allow the components to conmiunicate 
with each other via the bus 1430. 

The computer system 1400 may be connected to one or more other similar 
computers via the communications interface 1465 using a communication channel 
1485 to a network 1480, represented as the Intemet. 

The computer software program may be provided as a computer program 
product, and recorded on a portable storage medium. In this case, the computer 
software program is accessible by the computer system 1400 from the storage device 
1455. Alternatively, the computer softsvare may be accessible directly from the 
network 1480 by the computer 1420. hi either case, a user can interact with the 
computer system 1400 using the keyboard 1410 and mouse 1415 to operate the 
prograimned computer software executing on the computer 1420. 

The computer system 1400 has been described for illustrative purposes. 
Accordingly, the foregoing description relates to an example of a particular type of 
computer system suitable for practising the methods and computer program products 
described hereinbefore. Other configurations or types of computer systems can be 
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equally well used to practise the methods and computer program products described 
hereinbefore, as would be readily understood by persons skilled in the art. For 
example, the methods and computer program products described hereinbefore can be 
practised using a handheld computer such as a Personal Digital Assistant (PDA) or a 
mobile telephone. 

Methods, apparatuses and computer program products have been described 
hereinbefore for searching a plurality of machine-readable infonnation sources. The 
foregoing detailed description provides exemplary embodiments only, and is not 
intended to limit the scope, applicability or configurations of the invention. Rather, the 
description of the exemplary embodiments provides those skilled in the art with 
enabling descriptions for implementing an embodiment of the invention. Various 
changes may be made in the function and arrangement of elements without departing 
from the spirit and scope of the invention as set forth in the claims hereinafter. 

(Australia Only) In the context of this specification, the word "comprising" 
means "including principally but not necessarily solely" or "having" or "including", 
and not "consisting only of. Variations of the word "comprising", such as 
"comprise" and "comprises" have correspondingly varied meanings. 



